Search results

chapter

Reinforcement learning based throttle and brake control for autonomous vehicle following

Qi Zhu, Zhenhua Huang, Zhenping Sun, Daxue Liu, more

2017 Chinese Automation Congress (CAC) > 6657 - 6662

2017 Chinese Automation Congress (CAC)

In this paper, we focus on the basic form of autonomous follow driving problem with one leader and one follower. A reinforcement learning based throttle and brake control approach is developed for the follower vehicle. Near optimal control law is directly learned by “trial and error” with the neural dynamic programming algorithm. According to the timely updated following state, the learned control...

chapter

Human-in-the-loop reinforcement learning

Huanghuang Liang, Lu Yang, Hong Cheng, Wenzhe Tu, more

2017 Chinese Automation Congress (CAC) > 4511 - 4518

2017 Chinese Automation Congress (CAC)

This paper focuses on presenting a human-in-the-loop reinforcement learning theory framework and foreseeing its application to driving decision making. Currently, the technologies in human-vehicle collaborative driving face great challenges, and do not consider the Human-in-the-loop learning framework and Driving Decision-Maker optimization under the complex road conditions. The main content of this...

chapter

Comparison of reinforcement learning algorithms applied to the cart-pole problem

Savinay Nagendra, Nikhil Podila, Rashmi Ugarakhod, Koshy George

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 26 - 32

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Designing optimal controllers continues to be challenging as systems are becoming complex and are inherently nonlinear. The principal advantage of reinforcement learning (RL) is its ability to learn from the interaction with the environment and provide an optimal control strategy. In this paper, RL is explored in the context of control of the benchmark cart-pole dynamical system with no prior knowledge...

chapter

Power optimization using Markov decision process based on multi-parameter constraint modeling

Xiang Wang, Lin Li, Weike Wang, Pei Du, more

2017 International Conference on Circuits, Devices and Systems (ICCDS) > 68 - 72

2017 International Conference on Circuits, Devices and Systems (ICCDS)

Power optimization based on intelligent algorithm draws more and more attention. This article presents a novel low power optimization strategy based on the high level software power management employing Markov Process for charactering the real running workload. This article formulates workload characterization and selection with stochastic process method, and solves the formula using dynamic voltage...

chapter

Experimental study on decentralized concurrent learning for multi-agent system with complex dynamics

Ting Fei, Xin Chen, Min Wu, Chi Wang

2017 36th Chinese Control Conference (CCC) > 8373 - 8378

2017 36th Chinese Control Conference (CCC)

A cooperative multi-agent system entitles some independent agents to complete complex tasks through coordination and cooperation. Since the dynamics of physical agents are so complex that the environment of learning is indeed stochastic, the paper introduces the decentralized multi-agent reinforcement learning (MARL) algorithm, named as Decentralized Concurrent Learning with Cooperative Policy Exploration...

chapter

Simulation of intelligent traffic control for autonomous vehicles

Terje Kristensen, Nnamdi Johnson Ezeora

2017 IEEE International Conference on Information and Automation (ICIA) > 459 - 465

2017 IEEE International Conference on Information and Automation (ICIA)

Urban cities are getting more congested with vehicular traffic and most of the traffic control systems are not smart to detect and give priority to emergency vehicles. The effect results to inadequate services delivered by the public emergency agencies, and unnecessary traffic congestion to other road users at intersection points. In this paper we present an effective reinforced road traffic control...

chapter

An Autonomic Approach for the Selection of Robust Dynamic Loop Scheduling Techniques

Anthony Boulmier, Ioana Banicescu, Florina M. Ciorba, Nabil Abdennadher

2017 16th International Symposium on Parallel and Distributed Computing (ISPDC) > 9 - 17

2017 16th International Symposium on Parallel and Distributed Computing (ISPDC)

Parallel applications are highly irregular and high performance computing (HPC) infrastructures are very complex. The HPC applications of interest herein are timestepping scientific applications (TSSA). Often, TSSA involve the repeated execution of multiple parallel loops with thousands of iterations and irregular behavior. Dynamic loop scheduling (DLS) techniques were developed over time and have...

chapter

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 466 - 469

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Bayesian reinforcement learning provides an elegant solution to the optimal tradeoff between exploration and exploitation of the uncertainty in learning. Unfortunately, the size of the learning parameters grows exponentially with the problem horizon. In this paper, we propose a novel Monte Carlo tree search for Bayesian reinforcement learning approach using a compact factored representation, to solve...

chapter

CP-operated dash caching via reinforcement learning

Zhengyuan Pang, Lifeng Sun, Zhi Wang, Wen Hu, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 487 - 492

2017 IEEE International Conference on Multimedia and Expo (ICME)

In recent years, Dynamic Adaptive Streaming over HTTP (DASH) has gained momentum as an effective solution for delivering videos on the Internet. This trend is further driven by the deployment of existing HTTP cache infrastructures in DASH systems to reduce the traffic load as well as to serve clients better. However, deploying conventional cache servers in DASH systems still suffers from low cache...

chapter

Learn-as-You-Go with Megh: Efficient Live Migration of Virtual Machines

Debabrota Basu, Xiayang Wang, Yang Hong, Haibo Chen, more

2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) > 2608 - 2609

2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS)

We propose a reinforcement learning algorithm, Megh, for live migration of virtual machines that simultaneously reduces the cost of energy consumption and enhances the performance. Megh learns the uncertain dynamics of workloads as-it-goes. Megh uses a dimensionality reduction scheme to projectthe combinatorially explosive state-action space to a polynomial dimensional space. These schemes enable...

chapter

A novel jamming strategy-greedy bandit

Shaoshuai ZhuanSun, Jun-An Yang, Hui Liu, Keju Huang

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1142 - 1146

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

In an electronic warfare-type scenario, an optimal jamming strategy is vital important for a jammer who has restricted power and how to make the optimal strategies quickly and accurately put on the agenda. In this paper, we developed a cognitive jammer who could learn the optimal jamming strategies with the proposed algorithm-Greedy Bandits (GB). By interacting with transmitter-receiver pairs continually,...

chapter

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Shixiang Gu, Ethan Holly, Timothy Lillicrap, Sergey Levine

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3389 - 3396

2017 IEEE International Conference on Robotics and Automation (ICRA)

Reinforcement learning holds the promise of enabling autonomous robots to learn large repertoires of behavioral skills with minimal human intervention. However, robotic applications of reinforcement learning often compromise the autonomy of the learning process in favor of achieving training times that are practical for real physical systems. This typically involves introducing hand-engineered policy...

chapter

Information theoretic MPC for model-based reinforcement learning

Grady Williams, Nolan Wagener, Brian Goldfain, Paul Drews, more

2017 IEEE International Conference on Robotics and Automation (ICRA) > 1714 - 1721

2017 IEEE International Conference on Robotics and Automation (ICRA)

We introduce an information theoretic model predictive control (MPC) algorithm capable of handling complex cost criteria and general nonlinear dynamics. The generality of the approach makes it possible to use multi-layer neural networks as dynamics models, which we incorporate into our MPC algorithm in order to solve model-based reinforcement learning tasks. We test the algorithm in simulation on...

chapter

Can a reinforcement learning agent practice before it starts learning?

Minwoo Lee, Charles W. Anderson

2017 International Joint Conference on Neural Networks (IJCNN) > 4006 - 4013

2017 International Joint Conference on Neural Networks (IJCNN)

A reinforcement learning (RL) agent needs a fair amount of experience to find a near-optimal policy. Transfer learning has been investigated as a means to reduce the amount of experience required. Transfer learning, however, requires another similar reinforcement learning task as a transfer source, which can also be costly in the amount of experience required. In this research, we examine the possible...

chapter

Online control basis selection by a regularized actor critic algorithm

Jianjun Yuan, Andrew Lamperski

2017 American Control Conference (ACC) > 4448 - 4453

2017 American Control Conference (ACC)

Policy gradient algorithms are useful reinforcement learning methods which optimize a control policy by performing stochastic gradient descent with respect to controller parameters. In this paper, we extend actor-critic algorithms by adding an ℓ₁ norm regularization on the actor part, which makes our algorithm automatically select and optimize the useful controller basis functions. Our method is closely...

chapter

In-node cognitive power control in Wireless Sensor Networks

Michele Chincoli, Antonio Liotta

2017 IEEE International Conference on Communications Workshops (ICC Workshops) > 1099 - 1104

2017 IEEE International Conference on Communications Workshops (ICC Workshops)

Reliability, interoperability and efficiency are fundamental in Wireless Sensor Network deployment. Herein we look at how transmission power control may be used to reduce interference, which is particularly problematic in high-density conditions. We adopt a distributed approach where every node has the ability to learn which transmission power is most appropriate, given the network conditions and...

chapter

Adaptive State Space Partitioning of Markov Decision Processes for Elastic Resource Management

Konstantinos Lolos, Ioannis Konstantinou, Verena Kantere, Nectarios Koziris

2017 IEEE 33rd International Conference on Data Engineering (ICDE) > 191 - 194

2017 IEEE 33rd International Conference on Data Engineering (ICDE)

Modern large-scale computing deployments consist of complex applications running over machine clusters. An important issue there is the offering of elasticity, i.e., the dynamic allocation of resources to applications to meet fluctuating workload demands. Threshold based approaches are typically employed, yet they are difficult to configure and optimize. Approaches based on reinforcement learning...

chapter

Routing in dynamically changing node location scenarios: A reinforcement learning approach

Sudhir K. Routray, Sharmila K. P.

2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) > 458 - 462

2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)

Routing in dynamically changing node location scenarios is quite challenging and time consuming. The emerging wireless communication networks such as LTE advanced and 5G, device-to-device communications present such dynamically changing node locations. In mobile ad hoc networks, very often we come across such dynamically changing node location scenarios. In the Internet of things (IoTs), we will come...

chapter

Preventing instability in full echo Q-routing with adaptive learning rates

Maxim V. Kavalerov, Yuliya A. Shilova, Igor I. Bezukladnikov

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) > 155 - 159

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)

A routing algorithm based on Q-routing paradigm is proposed for ad-hoc dynamically changing networks. The technique derived from Full Echo approach is used to enhance exploration capacity and prevent instability of routing under high load conditions. The performance of routing is increased by random polling of neighbors according to the local estimates of the average delivery time in the network.

chapter

Influence of the battery life parameter on the Q-routing algorithm results

Yuliya A. Shilova, Igor I. Bezukladnikov

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) > 213 - 217

2017 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus)

There are several groups of routing algorithms in dynamically changing networks developing every year. We introduce an additional parameter “Battery Life decrease” in the existing Q-Routing protocol. The Battery Life is reduced in direct proportion to the number of packets transmitted to the node. The efficiency of the Optimized Battery Life Q-Routing protocol is estimated by total loss of network...

INFONA - science communication portal

Search results

Reinforcement learning based throttle and brake control for autonomous vehicle following

Human-in-the-loop reinforcement learning

Comparison of reinforcement learning algorithms applied to the cart-pole problem

Power optimization using Markov decision process based on multi-parameter constraint modeling

Experimental study on decentralized concurrent learning for multi-agent system with complex dynamics

Simulation of intelligent traffic control for autonomous vehicles

An Autonomic Approach for the Selection of Robust Dynamic Loop Scheduling Techniques

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

CP-operated dash caching via reinforcement learning

Learn-as-You-Go with Megh: Efficient Live Migration of Virtual Machines

A novel jamming strategy-greedy bandit

Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates

Information theoretic MPC for model-based reinforcement learning

Can a reinforcement learning agent practice before it starts learning?

Online control basis selection by a regularized actor critic algorithm

In-node cognitive power control in Wireless Sensor Networks

Adaptive State Space Partitioning of Markov Decision Processes for Elastic Resource Management

Routing in dynamically changing node location scenarios: A reinforcement learning approach

Preventing instability in full echo Q-routing with adaptive learning rates

Influence of the battery life parameter on the Q-routing algorithm results

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options